Erratum for Token and Type Constraints for Cross-Lingual Part-of-Speech Tagging
نویسندگان
چکیده
This is an erratum for the paper titled “Token and Type Constraints for Cross-Lingual Part-ofSpeech Tagging” by Täckström et al. (2013). It revises the results with the coupled constraints presented in the last three columns of Table 2 in the aforementioned paper.
منابع مشابه
Token and Type Constraints for Cross-Lingual Part-of-Speech Tagging
We consider the construction of part-of-speech taggers for resource-poor languages. Recently, manually constructed tag dictionaries from Wiktionary and dictionaries projected via bitext have been used as type constraints to overcome the scarcity of annotated data in this setting. In this paper, we show that additional token constraints can be projected from a resourcerich source language to a r...
متن کاملCross-Lingual Discriminative Learning of Sequence Models with Posterior Regularization
We present a framework for cross-lingual transfer of sequence information from a resource-rich source language to a resourceimpoverished target language that incorporates soft constraints via posterior regularization. To this end, we use automatically word aligned bitext between the source and target language pair, and learn a discriminative conditional random field model on the target side. Ou...
متن کاملCross-Lingual Morphological Tagging for Low-Resource Languages
Morphologically rich languages often lack the annotated linguistic resources required to develop accurate natural language processing tools. We propose models suitable for training morphological taggers with rich tagsets for low-resource languages without using direct supervision. Our approach extends existing approaches of projecting part-of-speech tags across languages, using bitext to infer ...
متن کاملPredicting Linguistic Structure with Incomplete and Cross-Lingual Supervision
Täckström, O. 2013. Predicting Linguistic Structure with Incomplete and Cross-Lingual Supervision. Acta Universitatis Upsaliensis. Studia Linguistica Upsaliensia 14. xii+215 pp. Uppsala. ISBN 978-91-554-8631-0. Contemporary approaches to natural language processing are predominantly based on statistical machine learning from large amounts of text, which has been manually annotated with the ling...
متن کاملCross-Lingual Part-of-Speech Tagging through Ambiguous Learning
When Part-of-Speech annotated data is scarce, e.g. for under-resourced languages, one can turn to cross-lingual transfer and crawled dictionaries to collect partially supervised data. We cast this problem in the framework of ambiguous learning and show how to learn an accurate history-based model. Experiments on ten languages show significant improvements over prior state of the art performance.
متن کامل